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A structure this pretty just had to exist. 

-James Watson, The Double Helix, 1968 

N ucleotides have a variety of roles in cellular metab¬ 
olism. They are the energy currency in metabolic 
transactions, the essential chemical links in the re¬ 
sponse of cells to hormones and other extracellular stim¬ 
uli, and the structural components of an array of en¬ 
zyme cofactors and metabolic intermediates. And, last 
but certainly not least, they are the constituents of nu¬ 
cleic acids: deoxyribonucleic acid (DNA) and ribonu¬ 
cleic acid (RNA), the molecular repositories of genetic 
information. The structure of every protein, and ulti¬ 
mately of every biomolecule and cellular component, is 
a product of information programmed into the nu¬ 
cleotide seguence of a cell's nucleic acids. The ability to 
store and transmit genetic information from one gener¬ 
ation to the next is a fundamental condition for life. 

This chapter provides an overview of the chemical 
nature of the nucleotides and nucleic acids found in 
most cells; a more detailed examination of the function 
of nucleic acids is the focus of Part III of this text. 

8.1 Some Basics 

Nucleotides, Building Blocks of Nucleic Acids The amino add 
seguence of every protein in a cell, and the nucleotide 
seguence of every RNA, is specified by a nucleotide se¬ 


guence in the cell's DNA. A segment of a DNA molecule 
that contains the information required for the synthesis 
of a functional biological product, whether protein or 
RNA, is referred to as a gene. A cell typically has many 
thousands of genes, and DNA molecules, not surpris¬ 
ingly, tend to be very large. The storage and transmis¬ 
sion of biological information are the only known func¬ 
tions of DNA. 

RNAs have a broader range of functions, and sev¬ 
eral classes are found in cells. Ribosomal RNAs 
(rRNAs) are components of ribosomes, the complexes 
that carry out the synthesis of proteins. Messenger 
RNAs (mRNAs) are intermediaries, carrying genetic 
information from one or a few genes to a ribosome, 
where the corresponding proteins can be synthesized. 
Transfer RNAs (tRNAs) are adapter molecules that 
faithfully translate the information in mRNA into a 
specific sequence of amino acids. In addition to these 
major classes there is a wide variety of RNAs with spe¬ 
cial functions, described in depth in Part III. 

Nucleotides and Nucleic Acids Have Characteristic 
Bases and Pentoses 

Nucleotides have three characteristic components: 
(1) a nitrogenous (nitrogen-containing) base, (2) a pen¬ 
tose, and (3) a phosphate (F ig. 8-1). The molecule with¬ 
out the phosphate group is called a nucleoside. The 
nitrogenous bases are derivatives of two parent com¬ 
pounds, pyrimidine and purine. The bases and pentoses 
of the common nucleotides are heterocyclic compounds. 
The carbon and nitrogen atoms in the parent structures 
are conventionally numbered to facilitate the naming 
and identification of the many derivative compounds. 
The convention for the pentose ring follows rules out¬ 
lined in Chapter 7, but in the pentoses of nucleotides 
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FIGURE 8-1 Structure of nucleotides, (a) General structure showing 
the numbering convention for the pentose ring. This is a ribonu¬ 
cleotide. In deoxyribonucleotides the —OH group on the 2' carbon 
(in red) is replaced with —H. (b)The parent compounds of the pyrim¬ 
idine and purine bases of nucleotides and nucleic acids, showing the 
numbering conventions. 
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FIGURE 8-2 Major purine and pyrimidine bases of nucleic acids. 

Some of the common names of these bases reflect the circumstances 
of their discovery. Guanine, for example, was first isolated from guano 
(bird manure), and thymine was first isolated from thymus tissue. 


and nucleosides the carbon numbers are given a prime 
(') designation to distinguish them from the numbered 
atoms of the nitrogenous bases. 

The base of a nucleotide is joined covalently (at N-l 
of pyrimidines and N-9 of purines) in an A/-j3-glycosyl 
bond to the V carbon of the pentose, and the phosphate 
is esterified to the 5' carbon. The /V-j3-glycosyl bond is 
formed by removal of the elements of water (a hydroxyl 
group from the pentose and hydrogen from the base), 
as in O-glycosidic bond formation (see Fig. 7-31). 

Both DNA and RNA contain two major purine bases, 
adenine (A) and guanine (G), and two major pyrim¬ 
idines. In both DNA and RNA one of the pyrimidines is 
cytosine (C), but the second major pyrimidine is not 
the same in both: it is thymine (T) in DNA and uracil 
(U) in RNA. Only rarely does thymine occur in RNA or 
uracil in DNA. The structures of the five major bases 
are shown in F igure 8-2, and the nomenclature of their 
corresponding nucleotides and nucleosides is summa¬ 
rized in Table 8-1. 

Nucleic acids have two kinds of pentoses. The re¬ 
curring deoxyribonucleotide units of DNA contain 2'- 
deoxy-D-ribose, and the ribonucleotide units of RNA 
contain o-ribose. In nucleotides, both types of pentoses 
are in their /3-furanose (closed five-membered ring) 
form. As Figure 8-3 shows, the pentose ring is not pla¬ 
nar but occurs in one of a variety of conformations gen¬ 
erally described as "puckered." 

Figure 8-4 gives the structures and names of the 
four major deoxyribonucleotides (deoxyribonucleo- 
side 5'-monophosphates), the structural units of DNAs, 
and the four major ribonucleotides (ribonucleoside 5'- 
monophosphates), the structural units of RN As. Specific 


long sequences of A, T, G, and C nucleotides in DNA are 
the repository of genetic information. 

Although nucleotides bearing the major purines and 
pyrimidines are most common, both DNA and RNA also 


(■> 





H—fj— OH 
H-C-OH 
H— OH 

GHjOH 

Alddhydc- 



OH 



OH OH 




m ii>. 


C-3 r uk*j 


liLL-ik!- 


J5 


C-S" 4Mi» 


FIGURE 8-3 Conformations of ribose. (a) In solution, the straight- 
chain (aldehyde) and ring (/3-furanose) forms of free ribose are in equi¬ 
librium. RNA contains only the ring form, /3-D-ribofuranose. Deoxy- 
ribose undergoes a similar interconversion in solution, but in DNA 
exists solely as /3-2'-deoxy-D-ribofuranose. (b) Ribofuranose rings in 
nucleotides can exist in four different puckered conformations. In all 
cases, four of the five atoms are in a single plane. The fifth atom 
(C-2' or C-3') is on either the same (endo) or the opposite (exo) side 
of the plane relative to the C-5' atom. 
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Nucleotide: Deoxyadenylate 

(deoxyadenosi ne 
5'-monophosphate) 


Deoxyguanylate 

(deoxyguanosine 

5'-monophosphate) 


Deoxythymidylate 

(deoxythymidine 

5'-monophosphate) 


Deoxycytidyl ate 
(deoxycytidine 
5'-monophosphate) 


Symbols: A, dA, dAMP 


G, dG, dGMP 


T, dT, dTMP 


C, dC, dCMP 


Nucleoside: Deoxyadenosi ne 


Deoxyguanosine 


Deoxythymidine 


Deoxycytidine 


(a) Deoxyribonucleotides 



Nucleotide: Adenylate (adenosine 

5'-monophosphate) 


Guanylate (guanosine 
5'-monophosphate) 


Uridyl ate (uridine 
5'-monophosphate) 


Cytidylate (cytidine 
5'-monophosphate) 


Symbols: A, AMP 


G, GMP 


U, UMP 


C, CMP 


Nucleoside: Adenosine 


Guanosine 


Uridine 


Cytidine 


(b) Ribonucleotides 


FIGURE 8-4 Deoxy ribonucleotides and ribonucleotides of nucleic 
acids. All nucleotides are shown in their free form at pH 7.0. The nu¬ 
cleotide units of DNA (a) are usually symbolized as A, G, T, and C, 
sometimes as dA, dG, dT, and dC; those of RNA (b) as A, G, U, and 
C. In their free form the deoxyribonucleotides are commonly abbre¬ 
viated dAMP, dGMP, dTMP, and dCMP; the ribonucleotides, AMP, 


GMP, UMP, and CMP. For each nucleotide, the more common name 
is followed by the complete name in parentheses. All abbreviations 
assume that the phosphate group is at the 5' position. The nucleoside 
portion of each molecule is shaded in light red. In this and the fol¬ 
lowing illustrations, the ring carbons are not shown. 


TABLE 8-1 Nucleotide and Nucleic Acid Nomenclature 


Base 

Nucleoside 

Nucleotide 

Nucleic acid 

Purines 

Adenine 

Adenosine 

Adenylate 

RNA 


Deoxyadenosine 

Deoxyadenylate 

DNA 

Guanine 

Guanosine 

Guanylate 

RNA 


Deoxyguanosine 

Deoxyguanylate 

DNA 

Pyrimidines 

Cytosine 

Cytidine 

Cytidylate 

RNA 


Deoxycytidine 

Deoxycytidylate 

DNA 

Thymine 

Thymidine or deoxythymidine 

Thymidylate or deoxythymidylate 

DNA 

Uracil 

Uridine 

Uridylate 

RNA 


Note: "Nucleoside” and "nucleotide” are 
generic terms that include both ribo- and 
deoxyribo- forms. Also, ribonucleosides and 
ribonucleotides are here designated simply 
as nucleosides and nucleotides (e.g., ribo- 
adenosine as adenosine), and deoxyribo- 
nucleosides and deoxyribonucleotides as 
deoxynucleosides and deoxynucleotides 
(e.g., deoxyriboadenosine as deoxyadeno- 
sine). Both forms of naming are accept¬ 
able, but the shortened names are more 
commonly used. Thymine is an exception; 
"ribothymidine” is used to describe its 
unusual occurrence in RNA. 
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FIGURE 8-5 Some minor purine and pyrimidine bases, shown as the 
nucleosides, (a) Minor bases of DNA. 5-Methylcytidine occurs in the 
DNA of animals and higher plants, A/ 6 -methyladenosine in bacterial 
DNA, and 5-hydroxymethylcytidine in the DNA of bacteria infected 
with certain bacteriophages, (b) Some minor bases of tRNAs. Inosine 
contains the base hypoxanthine. Note that pseudouridine, like uridine, 
contains uracil; they are distinct in the point of attachment to the 
ribose— in uridine, uracil is attached through N-l, the usual attach¬ 
ment point for pyrimidines; in pseudouridine, through C-5. 


here) is simply to indicate the ring position of the sub¬ 
stituent by its number—for example, 5-methylcytosine, 

7- methylguanine, and 5-hydroxymethylcytosine (shown 
as the nucleosides in Fig. 8-5). The element to which 
the substituent is attached (N, C, 0) is not identified. 
The convention changes when the substituted atom is 
exocyclic (not within the ring structure), in which case 
the type of atom is identified and the ring position to 
which it is attached is denoted with a superscript. The 
amino nitrogen attached to C-6 of adenine is A/ 6 ; simi¬ 
larly, the carbonyl oxygen and amino nitrogen at C-6 
and C-2 of guanine are O 6 and A/ 2 , respectively. Examples 
of this nomenclature are A/ 6 -methyladenosine and A/ 2 - 
methylguanosine (Fig. 8-5). 

Cells also contain nucleotides with phosphate 
groups in positions other than on the 5' carbon (Fig. 

8 - 6 ) Ribonucleoside 2',3'-cyclic monophosphates 
are isolatable intermediates, and ribonucleoside 3'- 
monophosphates are end products of the hydrolysis 
of RNA by certain ribonucleases. Other variations are 
adenosine 3',5'-cyclic monophosphate (cAMP) and 
guanosine 3',5'-cyclic monophosphate (cGMP), consid¬ 
ered at the end of this chapter. 

Phosphodiester Bonds Link Successive Nucleotides 
in Nucleic Acids 

The successive nucleotides of both DNA and RNA are 
covalently linked through phosphate-group "bridges," in 
which the 5'-phosphate group of one nucleotide unit is 
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contain some minor bases (Fig. 8-5). In DNA the most 
common of these are methylated forms of the major 
bases; in some viral DNAs, certain bases may be hy- 
droxymethylated or glucosylated. Altered or unusual 
bases in DNA molecules often have roles in regulating 
or protecting the genetic information. Minor bases of 
many types are also found in RNAs, especially in tRNAs 
(see Fig. 26-24). 

The nomenclature for the minor bases can be con¬ 
fusing. Like the major bases, many have common names— 
hypoxanthine, for example, shown as its nucleoside ino¬ 
sine in Figure 8-5. When an atom in the purine or 
pyrimidine ring is substituted, the usual convention (used 
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FIGURE 8-6 Some adenosine monophosphates. Adenosine 2- 
monophosphate, 3'-monophosphate, and 2',3'-cyclic monophosphate 
are formed by enzymatic and alkaline hydrolysis of RNA. 
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FIGURE 8-7 Phosphodiester linkages in the covalent backbone of 

DNA and RNA. The phosphodiester bonds (one of which is shaded in 
the DNA) link successive nucleotide units. The backbone of alternat¬ 
ing pentose and phosphate groups in both types of nucleic acid is 
highly polar. The 5' end of the macromolecule lacks a nucleotide at 
the 5' position, and the 3' end lacks a nucleotide at the 3' position. 


joined to the 3'-hydroxyl group of the next nucleotide, 
creating a phosphodiester linkage (Fig. 8-7). Thus 
the covalent backbones of nucleic acids consist of al¬ 
ternating phosphate and pentose residues, and the ni¬ 
trogenous bases may be regarded as side groups joined 
to the backbone at regular intervals. The backbones of 
both DNA and RNA are hydrophilic. The hydroxyl 
groups of the sugar residues form hydrogen bonds with 
water. The phosphate groups, with a p K a near 0, are 
completely ionized and negatively charged at pH 7, and 
the negative charges are generally neutralized by ionic 
interactions with positive charges on proteins, metal 
ions, and polyamines. 

All the phosphodiester linkages have the same ori¬ 
entation along the chain (Fig. 8-7), giving each linear 
nucleic acid strand a specific polarity and distinct 5' and 
3' ends. By definition, the 5' end lacks a nucleotide at 
the 5' position and the 3' end lacks a nucleotide at the 
3' position. Other groups (most often one or more phos¬ 
phates) may be present on one or both ends. 

The covalent backbone of DNA and RNA is subject 
to slow, nonenzymatic hydrolysis of the phosphodiester 
bonds, in the test tube, RNA is hydrolyzed rapidly un¬ 
der alkaline conditions, but DNA is not; the 2'-hydroxyI 
groups in RNA (absent in DNA) are directly involved in 
the process. Cyclic 2',3'-monophosphate nucleotides 
are the first products of the action of alkali on RNA and 
are rapidly hydrolyzed further to yield a mixture of 2'- 
and 3'-nucleoside monophosphates (Fig. 8-8). 

The nucleotide sequences of nucleic acids can be 
represented schematically, as illustrated on the follow¬ 
ing page by a segment of DNA with five nucleotide units. 
The phosphate groups are symbolized by ©, and each 
deoxyribose is symbolized by a vertical line, from C-l' 
at the top to C-5' at the bottom (but keep in mind that 
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FIGURE 8-8 Hydrolysis of RNA under alkaline 
conditions. The 2' hydroxyl acts as a nucleophile 
in an intramolecular displacement. The 2',3'-cyclic 
monophosphate derivative is further hydrolyzed to 
a mixture of 2'- and 3'-monophosphates. DNA, 
which lacks 2' hydroxyls, is stable under similar 
conditions. 
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the sugar is always in its closed-ring /3-furanose form in 
nucleic acids). The connecting lines between nucleotides 
(which pass through ©) are drawn diagonally from the 
middle (C-3') of the deoxyribose of one nucleotide to 
the bottom (C-5') of the next. 
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FIGURE 8-9 Tautomeric forms of uracil. The lactam form predomi¬ 
nates at pH 7.0; the other forms become more prominent as pH de¬ 
creases. The other free pyrimidines and the free purines also have tau¬ 
tomeric forms, but they are more rarely encountered. 


By convention, the structure of a single strand of nu¬ 
cleic acid is always written with the 5' end at the left 
and the 3' end at the right—that is, in the 5' —> 3' di¬ 
rection. Some simpler representations of this pentade- 
oxyribonucleotide are pA-C-G-T-A 0 H. pApCpGpTpA, 
and pACGTA. 

A short nucleic acid is referred to as an oligonu¬ 
cleotide. The definition of "short" is somewhat arbi¬ 
trary, but polymers containing 50 or fewer nucleotides 
are generally called oligonucleotides. A longer nucleic 
acid is called a polynucleotide. 

The Properties of Nucleotide Bases Affect 
the Three-Dimensional Structure of Nucleic Acids 

Free pyrimidines and purines are weakly basic com¬ 
pounds and are thus called bases. They have a variety 
of chemical properties that affect the structure, and 
ultimately the function, of nucleic acids. The purines 
and pyrimidines common in DNA and RNA are highly 
conjugated molecules (Fig. 8-2), a property with im¬ 
portant conseguences for the structure, electron distri¬ 
bution, and light absorption of nucleic acids. Resonance 
among atoms in the ring gives most of the bonds par¬ 
tial double-bond character. One result is that pyrim¬ 
idines are planar molecules; purines are very nearly 


planar, with a slight pucker. F ree pyrimidine and purine 
bases may exist in two or more tautomeric forms de¬ 
pending on the pH. Uracil, for example, occurs in lac¬ 
tam, lactim, and double lactim forms (Fig. 8-9). The 
structures shown in Figure 8-2 are the tautomers that 
predominate at pH 7.0. As a result of resonance, all nu¬ 
cleotide bases absorb UV light, and nucleic acids are 
characterized by a strong absorption at wavelengths 
near 260 nm (F ig. 8-10). 

The purine and pyrimidine bases are hydrophobic 
and relatively insoluble in water at the near-neutral pH 
of the cell. At acidic or alkaline pH the bases become 
charged and their solubility in water increases. Hy¬ 
drophobic stacking interactions in which two or more 
bases are positioned with the planes of their rings par¬ 
allel (like a stack of coins) are one of two important 
modes of interaction between bases in nucleic acids. The 
stacking also involves a combination of van der Waals 
and dipole-dipole interactions between the bases. Base 
stacking helps to minimize contact of the bases with wa¬ 
ter, and base-stacking interactions are very important in 
stabilizing the three-dimensional structure of nucleic 
acids, as described later. 
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FIGURE 8-10 Absorption spectra of the 
common nucleotides. The spectra are 
shown as the variation in molar extinction 
coefficient with wavelength. The molar 
extinction coefficients at 260 nm and 
pH 7.0 (e 2 6o) are listed in the table. The 
spectra of corresponding ribonucleotides 
and deoxyribonucleotides, as well as the 
nucleosides, are essentially identical. For 
mixtures of nucleotides, a wavelength of 
260 nm (dashed vertical line) is used for 
absorption measurements. 
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FIGURE 8-11 Hydrogen-bonding patterns in the base pairs defined 
by Watson and Crick. Here as elsewhere, hydrogen bonds are 
represented by three blue lines. 


The most important functional groups of pyrim¬ 
idines and purines are ring nitrogens, carbonyl groups, 
and exocyclic amino groups. Hydrogen bonds involving 
the amino and carbonyl groups are the second impor¬ 
tant mode of interaction between bases in nucleic acid 
molecules. Hydrogen bonds between bases permit a 
complementary association of two (and occasionally 
three or four) strands of nucleic acid. The most impor¬ 
tant hydrogen-bonding patterns are those defined by 
J ames D. Watson and F rancis Crick in 1953, in which A 
bonds specifically to T (or U) and G bonds to C (Fig. 
8-11). These two types of base pairs predominate in 
double-stranded DNA and RNA, and the tautomers 
shown in Figure 8-2 are responsible for these patterns. 
It is this specific pairing of bases that permits the du¬ 
plication of genetic information, as we shall discuss later 
in this chapter. 


SUMMARY 8.1 Some Basics 


■ A nucleotide consists of a nitrogenous base 

(purine or pyrimidine), a pentose sugar, and 
one or more phosphate groups. Nucleic acids 
are polymers of nucleotides, joined together by 
phosphodiester linkages between the 5'- 
hydroxyl group of one pentose and the 3'- 
hydroxyl group of the next. 

■ There are two types of nucleic acid: RNA and 
DNA. The nucleotides in RNA contain ribose, 
and the common pyrimidine bases are uracil 
and cytosine. In DNA, the nucleotides contain 
2'-deoxyribose, and the common pyrimidine 
bases are thymine and cytosine. The primary 
purines are adenine and guanine in both RNA 
and DNA. 



James Watson Francis Crick 


8.2 Nucleic Acid Structure 

The discovery of the structure of DNA by Watson and 
Crick in 1953 was a momentous event in science, an 
event that gave rise to entirely new disciplines and in¬ 
fluenced the course of many established ones. Our pres¬ 
ent understanding of the storage and utilization of a 
cell's genetic information is based on work made possi¬ 
ble by this discovery, and an outline of how genetic in¬ 
formation is processed by the cell is now a prerequisite 
for the discussion of any area of biochemistry. Here, we 
concern ourselves with DNA structure itself, the events 
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that led to its discovery, and more recent refinements 
in our understanding. RNA structure is also introduced. 

As in the case of protein structure (Chapter 4), it 
is sometimes useful to describe nucleic acid structure 
in terms of hierarchical levels of complexity (primary, 
secondary, tertiary). The primary structure of a nucleic 
acid is its covalent structure and nucleotide sequence. 
Any regular, stable structure taken up by some or all of 
the nucleotides in a nucleic acid can be referred to as 
secondary structure. All structures considered in the re¬ 
mainder of this chapter fall under the heading of sec¬ 
ondary structure. The complex folding of large chro¬ 
mosomes within eukaryotic chromatin and bacterial 
nucleoids is generally considered tertiary structure; this 
is discussed in Chapter 24. 

DNA Stores Genetic Information 

The biochemical investigation of DNA began with 
F riedrich M iescher, who carried out the first systematic 
chemical studies of cell nuclei. In 1868 M iescher isolated 
a phosphorus-containing substance, which he called 
"nuclein," from the nuclei of pus cells (leukocytes) ob¬ 
tained from discarded surgical bandages. Fie found 
nuclein to consist of an acidic portion, which we know 
today as DNA, and a basic portion, protein. Miescher 
later found a similar acidic substance in the heads of 
sperm cells from salmon. Although he partially purified 
nuclein and studied its properties, the covalent (pri¬ 
mary) structure of DNA (as shown in Fig. 8-7) was not 
known with certainty until the late 1940s. 

Miescher and many others suspected that nuclein 
(nucleic acid) was associated in some way with cell in¬ 
heritance, but the first direct evidence that DNA is the 
bearer of genetic information came in 1944 through a 
discovery made by Oswald T. Avery, Colin MacLeod, and 
Maclyn McCarty. These investigators found that DNA 
extracted from a virulent (disease-causing) strain of the 
bacterium Streptococcus pneumoniae, also known as 
pneumococcus, genetically transformed a nonvirulent 
strain of this organism into a virulent form (Fig. 8-12). 


FIGURE 8-12 The Avery-MacLeod-McCarty experiment, (a) When 
injected into mice, the encapsulated strain of pneumococcus is lethal, 
(b) whereas the nonencapsulated strain, (c) like the heat-killed en¬ 
capsulated strain, is harmless, (d) Earlier research by the bacteriolo¬ 
gist Frederick Griffith had shown that adding heat-killed virulent bac¬ 
teria (harmless to mice) to a live nonvirulent strain permanently 
transformed the latter into lethal, virulent, encapsulated bacteria, 
(e) Avery and his colleagues extracted the DNA from heat-killed vir¬ 
ulent pneumococci, removing the protein as completely as possible, 
and added this DNA to nonvirulent bacteria. The DNA gained en¬ 
trance into the nonvirulent bacteria, which were permanently trans¬ 
formed into a virulent strain. 
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32 P experiment 


35 S experiment 


Avery and his colleagues concluded that the DNA ex¬ 
tracted from the virulent strain carried the inheritable ge¬ 
netic message for virulence. Not everyone accepted these 
conclusions, because protein impurities present in the 
DNA could have been the carrier of the genetic informa¬ 
tion. This possibility was soon eliminated by the finding 
that treatment of the DNA with proteolytic enzymes did 
not destroy the transforming activity, but treatment with 
deoxyribonucleases (DNA-hydrolyzing enzymes) did. 

A second important experiment provided inde¬ 
pendent evidence that DNA carries genetic information. 
In 1952 Alfred D. Hershey and Martha Chase used ra¬ 
dioactive phosphorus ( 32 P) and radioactive sulfur ( 35 S) 
tracers to show that when the bacterial virus (bacterio¬ 
phage) T2 infects its host cell, Escherichia coli, it is 
the phosphorus-containing DNA of the viral particle, not 
the sulfur-containing protein of the viral coat, that en¬ 
ters the host cell and furnishes the genetic information 
for viral replication (Fig. 8-13). These important early 
experiments and many other lines of evidence have 
shown that DNA is the exclusive chromosomal compo¬ 
nent bearing the genetic information of living cells. 

DNA Molecules Have Distinctive Base Compositions 

A most important clue to the structure of DNA came 
from the work of Erwin Chargaff and his colleagues in 
the late 1940s. They found that the four nucleotide 
bases of DNA occur in different ratios in the DNAs of 
different organisms and that the amounts of certain 
bases are closely related. These data, collected from 
DNAs of a great many different species, led Chargaff to 
the following conclusions: 

1. The base composition of DNA generally varies 
from one species to another. 

2. DNA specimens isolated from different tissues of 
the same species have the same base composition. 

3. The base composition of DNA in a given species 
does not change with an organism's age, 
nutritional state, or changing environment. 

4. In all cellular DNAs, regardless of the species, the 
number of adenosine residues is equal to the 
number of thymidine residues (that is, A = T), 
and the number of guanosine residues is equal to 
the number of cytidine residues (G = C). From 
these relationships it follows that the sum of the 
purine residues equals the sum of the pyrimidine 
residues; that is, A + G = T + C. 

These quantitative relationships, sometimes called 
"Chargaff's rules," were confirmed by many subsequent 
researchers. They were a key to establishing the three- 
dimensional structure of DNA and yielded clues to how 
genetic information is encoded in DNA and passed from 
one generation to the next. 
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FIGURE 8-13 The Hershey-Chase experiment. Two batches of iso- 
topically labeled bacteriophageT2 particles were prepared. One was 
labeled with 32 P in the phosphate groups of the DNA, the other with 
35 S in the sulfur-containing amino acids of the protein coats (capsids). 
(Note that DNA contains no sulfur and viral protein contains no phos¬ 
phorus.) The two batches of labeled phage were then allowed to in¬ 
fect separate suspensions of unlabeled bacteria. Each suspension of 
phage-infected cells was agitated in a blender to shear the viral cap¬ 
sids from the bacteria. The bacteria and empty viral coats (called 
“ghosts") were then separated by centrifugation. The cells infected with 
the 32 P-labeled phage were found to contain 32 P, indicating that the 
labeled viral DNA had entered the cells; the viral ghosts contained no 
radioactivity. The cells infected with 35 S-labeled phage were found to 
have no radioactivity after blender treatment, but the viral ghosts con¬ 
tained 35 S. Progeny virus particles (not shown) were produced in both 
batches of bacteria some time after the viral coats were removed, in¬ 
dicating that the genetic message for their replication had been in¬ 
troduced by viral DNA, not by viral protein. 
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DNA Is a Double Helix 

To shed more light on the structure of DNA, Rosalind 
Franklin and Maurice Wilkins used the powerful method 
of x-ray diffraction (see Box 4-4) to analyze DNA fibers. 
They showed in the early 1950s that DNA produces a 
characteristic x-ray diffraction pattern (Fig. 8-14). 
From this pattern it was deduced that DNA molecules 
are helical with two periodicities along their long axis, 
a primary one of 3.4 A and a secondary one of 34 A. The 
problem then was to formulate a three-dimensional 
model of the DNA molecule that could account not only 
for the x-ray diffraction data but also for the spe¬ 
cific A = T and G = C base equivalences discovered by 
Chargaff and for the other chemical properties of DNA. 

In 1953 Watson and Crick postulated a three- 
dimensional model of DNA structure that accounted for 
all the available data. It consists of two helical DNA 
chains wound around the same axis to form a right- 
handed double helix (see Box 4-1 for an explanation of 
the right- or left-handed sense of a helical structure). 
The hydrophilic backbones of alternating deoxyribose 
and phosphate groups are on the outside of the double 
helix, facing the surrounding water. The furanose ring 
of each deoxyribose is in the C-2' endo conformation. 
The purine and pyrimidine bases of both strands are 
stacked inside the double helix, with their hydrophobic 
and nearly planar ring structures very close together 
and perpendicular to the long axis. The offset pairing of 
the two strands creates a major groove and minor 
groove on the surface of the duplex (Fig. 8-15). Each 
nucleotide base of one strand is paired in the same plane 
with a base of the other strand. Watson and Crick found 
that the hydrogen-bonded base pairs illustrated in Fig¬ 
ure 8-11, G with C and A with T, are those that fit best 
within the structure, providing a rationale for Chargaff’s 
rule that in any DNA, G = C and A = T. It is important 
to note that three hydrogen bonds can form between G 
and C, symbolized G=C, but only two can form between 
A and T, symbolized A=T. This is one reason for the 



FIGURE 8-14 X-ray diffraction pattern of DNA. The spots forming a 
cross in the center denote a helical structure. The heavy bands at the 
left and right arise from the recurring bases. 




Rosalind Franklin, Maurice Wilkins 

1920-1958 

finding that separation of paired DNA strands is more 
difficult the higher the ratio of G=C to A=T base pairs. 
Other pairings of bases tend (to varying degrees) to 
destabilize the double-helical structure. 

When Watson and Crick constructed their model, 
they had to decide at the outset whether the strands 
of DNA should be parallel or antiparallel— whether 
their 5',3'-phosphodiester bonds should run in the same 
or opposite directions. An antiparallel orientation pro¬ 
duced the most convincing model, and later work with 
DNA polymerases (Chapter 25) provided experimental 
evidence that the strands are indeed antiparallel, a find¬ 
ing ultimately confirmed by x-ray analysis. 

To account for the periodicities observed in the x- 
ray diffraction patterns of DNA fibers, Watson and Crick 
manipulated molecular models to arrive at a structure 


: "j 






: a.iA 


M .tiftf . 


r 


y 


Tit 




fli. A 


gnxwiV ■ 


luZuri 


y ^ 


Xm T| 


. 


%-■ - 





•a V 


afiA 


(*> 




FIGURE 8-15 Watson-Crick model for the structure of DNA. The 

original model proposed by Watson and Crick had 10 base pairs, or 
34 A (3.4 nm), per turn of the helix; subsequent measurements revealed 
10.5 base pairs, or 36 A (3.6 nm), per turn, (a) Schematic represen¬ 
tation, showing dimensions of the helix, (b) Stick representation show¬ 
ing the backbone and stacking of the bases, (c) Space-filling model. 















8.2 Nucleic Acid Structure 


283 


in which the vertically stacked bases inside the double 
helix would be 3.4 A apart; the secondary repeat dis¬ 
tance of about 34 A was accounted for by the presence 
of 10 base pairs in each complete turn of the double 
helix. In aqueous solution the structure differs slightly 
from that in fibers, having 10.5 base pairs per helical 
turn (Fig 8-15). 

As Figure 8-16 shows, the two antiparallel polynu¬ 
cleotide chains of double-helical DNA are not identical 
in either base sequence or composition. Instead they are 
complementary to each other. Wherever adenine oc¬ 
curs in one chain, thymine is found in the other; simi¬ 
larly, wherever guanine occurs in one chain, cytosine is 
found in the other. 

The DNA double helix, or duplex, is held together 
by two forces, as described earlier: hydrogen bonding 
between complementary base pairs (Fig. 8-11) and 
base-stacking interactions. The complementarity be¬ 
tween the DNA strands is attributable to the hydrogen 
bonding between base pairs. The base-stacking interac¬ 
tions, which are largely nonspecific with respect to the 
identity of the stacked bases, make the major contribu¬ 
tion to the stability of the double helix. 

The important features of the double-helical model 
of DNA structure are supported by much chemical and 
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FIGURE 8-16 Complementarity of strands in the DNA double helix. 

The complementary antiparallel strands of DNA follow the pairing 
rules proposed by Watson and Crick. The base-paired antiparallel 
strands differ in base composition: the left strand has the composition 
A 3 T 2 Gi C 3 ; the right, A 2 T 3 G 3 Ci.They also differ in sequence when 
each chain is read in the 5' 3' direction. Note the base equiva¬ 

lences: A = T and G = C in the duplex. 



W* ' r y 


JF 




■-j, nib. 

ft 

_ .a 

i j r rx.ii| 


|. i.TCiTJpn 


/JPPI i; t ■ ’I p'W. 


y 


1VP4 


h p 


\ ■ 





OTjfl 

New 




ii m. 





1A "t* 
F~ m 


w 


■ ■ MpAl 

| 'i Vi tWHI 
JpBe 





.yfAjrf'l-t-ppfek 

■n \ L I- 'WrMl 

TCI «.'■■■“ i— mu 
i ^ i 

w- 1"+' 



Sfea 





4 


gS*\ 

Li . SL 

npBFi ■ 


x Vi 



■■■ ■tii . , ■, 

C. h -3 

HFTf 

rjmwr 

. ^ ■’W 


i 

Fual 
Prim ii 


r \ 





.p 


ri.il PfaHIli 

T*fii ii* ?il:TiU". :l 


FIGURE 8-17 Replication of DNA as suggested by Watson and Crick. 

The preexisting or "parent" strands become separated, and each is the 
template for biosynthesis of a complementary "daughter" strand (in red). 


biological evidence. Moreover, the model immediately 
suggested a mechanism for the transmission of genetic 
information. The essential feature of the model is the 
complementarity of the two DNA strands. As Watson and 
Crick were able to see, well before confirmatory data be¬ 
came available, this structure could logically be replicated 
by (1) separating the two strands and (2) synthesizing 
a complementary strand for each. Because nucleotides 
in each new strand are joined in a sequence specified by 
the base-pairing rules stated above, each preexisting 
strand functions as a template to guide the synthesis of 
one complementary strand (Fig. 8-17). These expecta¬ 
tions were experimentally confirmed, inaugurating a rev¬ 
olution in our understanding of biological inheritance. 

DNA Can Occur in Different Three-Dimensional Forms 

DNA is a remarkably flexible molecule. Considerable ro¬ 
tation is possible around a number of bonds in the 
sugar-phosphate (phosphodeoxyribose) backbone, and 
thermal fluctuation can produce bending, stretching, and 
unpairing (melting) of the strands. Many significant de¬ 
viations from the Watson-Crick DNA structure are found 
in cellular DNA, some or all of which may play impor¬ 
tant roles in DNA metabolism. These structural varia¬ 
tions generally do not affect the key properties of DNA 
defined by Watson and Crick: strand complementarity, 
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FIGURE 8-18 Structural variation in DNA. (a) The conformation of a nucleotide in 
DNA is affected by rotation about seven different bonds. Six of the bonds rotate freely. 
The limited rotation about bond 4 gives rise to ring pucker, in which one of the atoms in 
the five-membered furanose ring is out of the plane described by the other four. This 
conformation is endo or exo, depending on whether the atom is displaced to the same 
side of the plane as C-5' or to the opposite side (see Fig. 8-3b). (b) For purine bases in 
nucleotides, only two conformations with respect to the attached ribose units are 
sterically permitted, anti or syn. Pyrimidines generally occur in the anti conformation. 


antiparallel strands, and the requirement for A=T and 
G=C base pairs. 

Structural variation in DNA reflects three things: 
the different possible conformations of the deoxyribose, 
rotation about the contiguous bonds that make up the 
phosphodeoxyribose backbone (Fig. 8-18a), and free 
rotation about the C-l'-W-glycosyl bond (Fig. 8- 18b). 
Because of steric constraints, purines in purine nu¬ 
cleotides are restricted to two stable conformations with 
respect to deoxyribose, called syn and anti (F ig. 8-18b). 
Pyrimidines are generally restricted to the anti confor¬ 
mation because of steric interference between the sugar 
and the carbonyl oxygen at C-2 of the pyrimidine. 


The Watson-Crick structure is also referred to as B- 
form DNA, or B-DNA. The B form is the most stable 
structure for a random-sequence DNA molecule under 
physiological conditions and is therefore the standard 
point of reference in any study of the properties of DNA. 
Two structural variants that have been well character¬ 
ized in crystal structures are the A and Z forms. These 
three DNA conformations are shown in Figure 8-19, 
with a summary of their properties. The A form is fa¬ 
vored in many solutions that are relatively devoid of wa¬ 
ter. The DNA is still arranged in a right-handed double 
helix, but the helix is wider and the number of base pairs 
per helical turn is 11, rather than 10.5 as in B-DNA. The 




A form 
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Z form 

Helical sense 

Right handed 

Right handed 

Left handed 

Diameter 

Base pairs per helical 

—26 A 

-20 A 

—18 A 

turn 

11 

10.5 

12 

Helix rise per base pair 

Base tilt normal to the 

2.6 A 

3.4 A 

3.7 A 

helix axis 

20° 

6° 

7° 

Sugar pucker conformation 
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C-2' endo for 
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FIGURE 8-19 Comparison of A, B, and Z forms of DNA. Each struc- 
ture shown here has 36 base pairs. The bases are shown in gray, the 
phosphate atoms in yellow, and the riboses and phosphate oxygens 
in blue. Blue is the color used to represent DNA strands in later chap¬ 
ters. The table summarizes some properties of the three forms of DNA. 
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plane of the base pairs in A-DN A is tilted about 20° with 
respect to the helix axis. These structural changes 
deepen the major groove while making the minor groove 
shallower. The reagents used to promote crystallization 
of DNA tend to dehydrate it, and thus most short DNA 
molecules tend to crystallize in the A form. 

Z-form DNA is a more radical departure from the B 
structure; the most obvious distinction is the left- 
handed helical rotation. There are 12 base pairs per hel¬ 
ical turn, and the structure appears more slender and 
elongated. The DNA backbone takes on a zigzag ap¬ 
pearance. Certain nucleotide seguences fold into left- 
handed Z helices much more readily than others. Promi¬ 
nent examples are sequences in which pyrimidines 
alternate with purines, especially alternating C and G or 
5-methyl-C and G residues. To form the left-handed 
helix in Z-DNA, the purine residues flip to the syn 
conformation, alternating with pyrimidines in the anti 
conformation. The major groove is barely apparent in 
Z-DNA, and the minor groove is narrow and deep. 

Whether A-DNA occurs in cells is uncertain, but there 
is evidence for some short stretches (tracts) of Z-DNA 
in both prokaryotes and eukaryotes. These Z-DNA tracts 
may play a role (as yet undefined) in regulating the ex¬ 
pression of some genes or in genetic recombination. 

Certain DNA Sequences Adopt Unusual Structures 

A number of other sequence-dependent structural vari¬ 
ations have been detected within larger chromosomes 
that may affect the function and metabolism of the DNA 
segments in their immediate vicinity. For example, 
bends occur in the DNA helix wherever four or more 
adenosine residues appear sequentially in one strand. 
Six adenosines in a row produce a bend of about 18°. 


P-nSin diviii'T 



T T AOCAEGTQCTAA 




A A T C □ T G C A C G A T T 


Minwr rapym 


* ** 


T T A n C A Cl JC A C G A T T 

i * l i i i * t i i i * t i 

A A T f: Ci T G G T G C T A A 


FIGURE 8-20 Palindromes and mirror repeats. Palindromes are se¬ 
quences of double-stranded nucleic acids with twofold symmetry. In 
order to superimpose one repeat (shaded sequence) on the other, it 
must be rotated 180° about the horizontal axis then 180° about the 
vertical axis, as shown by the colored arrows. A mirror repeat, on the 
other hand, has a symmetric sequence within each strand. Superim¬ 
posing one repeat on the other requires only a single 180° rotation 
about the vertical axis. 


The bending observed with this and other sequences may 
be important in the binding of some proteins to DNA. 

A rather common type of DNA sequence is a palin¬ 
drome. A palindrome is a word, phrase, or sentence 
that is spelled identically read either forward or back¬ 
ward; two examples are ROTATOR and NURSES RUN. 
The term is applied to regions of DNA with inverted 
repeats of base sequence having twofold symmetry 
over two strands of DNA (Fig. 8-20). Such sequences 
are self-complementary within each strand and there¬ 
fore have the potential to form hairpin or cruciform 
(cross-shaped) structures (Fig. 8-21). When the in¬ 
verted repeat occurs within each individual strand of 
the DNA, the sequence is called a mirror repeat. 
Mirror repeats do not have complementary sequences 
within the same strand and cannot form hairpin or cru¬ 
ciform structures. Sequences of these types are found 
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FIGURE 8-21 Hairpins and cruciforms. Palindromic DNA (or RNA) 
sequences can form alternative structures with intrastrand base pair¬ 
ing. (a) When only a single DNA (or RNA) strand is involved, the 
structure is called a hairpin, (b) When both strands of a duplex DNA 
are involved, it is called a cruciform. Blue shading highlights asym¬ 
metric sequences that can pair with the complementary sequence ei¬ 
ther in the same strand or in the complementary strand. 
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in virtually every large DNA molecule and can encom¬ 
pass a few base pairs or thousands. The extent to which 
palindromes occur as cruciforms in cells is not known, 
although some cruciform structures have been demon¬ 
strated in vivo in E.coli Self-complementary seguences 
cause isolated single strands of DNA (or RNA) in solu¬ 
tion to fold into complex structures containing multiple 
hairpins. 

Several unusual DNA structures involve three or even 
four DNA strands. These structural variations merit 
investigation because there is a tendency for many of 
them to appear at sites where important events in DNA 
metabolism (replication, recombination, transcription) 
are initiated or regulated. Nucleotides participating in a 


Watson-Crick base pair (Fig. 8-11) can form a number 
of additional hydrogen bonds, particularly with func¬ 
tional groups arrayed in the major groove. For example, 
a cytidine residue (if protonated) can pair with the 
guanosine residue of a G=C nucleotide pair, and a 
thymidine can pair with the adenosine of an A=T pair 
(Fig. 8-22). The N-7, O 6 , and A/ 6 of purines, the atoms 
that participate in the hydrogen bonding of triplex DNA, 
are often referred to as Hoogsteen positions, and the 
non-Watson-Crick pairing is called Hoogsteen pairing, 
after Karst FHoogsteen, who in 1963 first recognized the 
potential for these unusual pairings. FI oogsteen pairing 
allows the formation of triplex DNAs. The triplexes 
shown in Figure 8-22 (a, b) are most stable at low pH 
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FIGURE 8-22 DNA structures containing three or four DNA strands. 

(a) Base-pairing patterns in one well-characterized form of triplex 
DNA. The Hoogsteen pair in each case is shown in red. (b) Triple¬ 
helical DNA containing two pyrimidine strands (poly(T)) and one 
purine strand (poly(A)) (derived from PDB ID 1BCE). The dark blue 
and light blue strands are antiparallel and paired by normal Watson- 
Crick base-pairing patterns. The third (all-pyrimidine) strand (purple) 
is parallel to the purine strand and paired through non-Watson-Crick 
hydrogen bonds. The triplex is viewed end-on, with five triplets shown. 
0 nly the triplet closest to the viewer is colored, (c) Base-pairing pat¬ 
tern in the guanosine tetraplex structure. (d)Two successive tetraplets 
from a G tetraplex structure (derived from PDB ID 1QDG), viewed 
end-on with the one closest to the viewer in color, (e) Possible vari¬ 
ants in the orientation of strands in a G tetraplex. 
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because the C=G • C + triplet requires a protonated cy¬ 
tosine. In the triplex, the p K a of this cytosine is >7.5, 
altered from its normal value of 4.2. The triplexes also 
form most readily within long sequences containing only 
pyrimidines or only purines in a given strand. Some 
triplex DNAs contain two pyrimidine strands and one 
purine strand; others contain two purine strands and 
one pyrimidine strand. 

Four DNA strands can also pair to form a tetraplex 
(quadruplex), but this occurs readily only for DNA se¬ 
quences with a very high proportion of guanosine 
residues (Fig. 8-22c, d). The guanosine tetraplex, or G 
tetraplex, is quite stable over a wide range of condi¬ 
tions. The orientation of strands in the tetraplex can 
vary as shown in Figure 8-22e. 

A particularly exotic DNA structure, known as 
H-DNA, is found in polypyrimidine or polypurine tracts 
that also incorporate a mirror repeat. A simple example 
is a long stretch of alternating T and C residues (Fig. 
8-23). The FH-DNA structure features the triple-stranded 
form illustrated in Figure 8-22 (a, b). Two of the three 
strands in the FI-DNA triple helix contain pyrimidines 
and the third contains purines. 

In the DNA of living cells, sites recognized by many 
sequence-specific DNA-binding proteins (Chapter 28) 
are arranged as palindromes, and polypyrimidine or 
polypurine sequences that can form triple helices or 
even FI-DNA are found within regions involved in the 
regulation of expression of some eukaryotic genes. In 
principle, synthetic DNA strands designed to pair with 
these sequences to form triplex DNA could disrupt gene 
expression. This approach to controlling cellular me¬ 
tabolism is of growing commercial interest for its po¬ 
tential application in medicine and agriculture. 

Messenger RNAs Code for Polypeptide Chains 

We now turn our attention briefly from DNA structure 
to the expression of the genetic information that it con¬ 
tains. RNA, the second major form of nucleic acid in 
cells, has many functions. In gene expression, RNA acts 
as an intermediary by using the information encoded in 
DNA to specify the amino acid sequence of a functional 
protein. 

Given that the DNA of eukaryotes is largely con¬ 
fined to the nucleus whereas protein synthesis occurs 
on ribosomes in the cytoplasm, some molecule other 
than DNA must carry the genetic message from the nu¬ 
cleus to the cytoplasm. As early as the 1950s, RNA was 
considered the logical candidate: RNA is found in both 
the nucleus and the cytoplasm, and an increase in pro¬ 
tein synthesis is accompanied by an increase in the 
amount of cytoplasmic RNA and an increase in its rate 
of turnover. These and other observations led several 
researchers to suggest that RNA carries genetic infor¬ 
mation from DNA to the protein biosynthetic machin- 
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FIGURE 8-23 H-DNA. (a) A sequence of alternating! and C residues 
can be considered a mirror repeat centered about a central T or C. 
(b) These sequences form an unusual structure in which the strands 
in one half of the mirror repeat are separated and the pyrimidine- 
containing strand (alternating! and C residues) folds back on the other 
half of the repeat to form a triple helix. The purine strand (alternating 
A and G residues) is left unpaired. This structure produces a sharp 
bend in the DNA. 


ery of the ribosome. In 1961F rangois J acob and J acques 
Monod presented a unified (and essentially correct) pic¬ 
ture of many aspects of this process. They proposed the 
name "messenger RN A" (mRN A) for that portion of the 
total cellular RNA carrying the genetic information from 
DNA to the ribosomes, where the messengers provide 
the templates that specify amino acid sequences in 
polypeptide chains. Although mRNAs from different 
genes can vary greatly in length, the mRNAs from a par¬ 
ticular gene generally have a defined size. The process 
of forming mRNA on a DNA template is known as 
transcription. 

In prokaryotes, a single mRNA molecule may code for 
one or several polypeptide chains. If it carries the code 
for only one polypeptide, the mRNA is monocistronic; 
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FIGURE 8-24 Prokaryotic mRNA. Schematic diagrams show (a) 
monocistronic and (b) polycistronic mRNAs of prokaryotes. Red seg¬ 
ments represent RNA coding for a gene product; gray segments rep¬ 
resentnoncoding RNA. In the polycistronic transcript, noncoding RNA 
separates the three genes. 


if it codes for two or more different polypeptides, the 
mRNA is polycistronic. In eukaryotes, most mRNAs 
are monocistronic. (For the purposes of this discussion, 
"cistron" refers to a gene. The term itself has historical 
roots in the science of genetics, and its formal genetic 
definition is beyond the scope of this text.) The mini¬ 
mum length of an mRNA is set by the length of the 
polypeptide chain for which it codes. For example, a 
polypeptide chain of 100 amino acid residues requires 
an RNA coding sequence of at least 300 nucleotides, be¬ 
cause each amino acid is coded by a nucleotide triplet 
(this and other details of protein synthesis are discussed 
in Chapter 27). Flowever, mRNAs transcribed from DNA 
are always somewhat longer than the length needed sim¬ 
ply to code for a polypeptide sequence (or sequences). 
The additional, noncoding RNA includes sequences that 
regulate protein synthesis. Figure 8-24 summarizes the 
general structure of prokaryotic mRNAs. 

Many RNAs Have More Complex 
Three-Dimensional Structures 

Messenger RNA is only one of several classes of cellu¬ 
lar RNA. Transfer RNAs serve as adapter molecules in 
protein synthesis; covalently linked to an amino acid at 
one end, they pair with the mRNA in such a way that 
amino acids are joined to a growing polypeptide in the 
correct sequence. Ribosomal RNAs are components of 
ribosomes. There is also a wide variety of special-func¬ 
tion RNAs, including some (called ribozymes) that have 
enzymatic activity. All the RNAs are considered in de¬ 
tail in Chapter 26. The diverse and often complex func¬ 
tions of these RNAs reflect a diversity of structure much 
richer than that observed in DNA molecules. 

The product of transcription of DNA is always 
single-stranded RNA. The single strand tends to assume 
a right-handed helical conformation dominated by base¬ 
stacking interactions (F ig. 8-25), which are stronger be¬ 
tween two purines than between a purine and pyrimi¬ 
dine or between two pyrimidines. The purine-purine 
interaction is so strong that a pyrimidine separating two 
purines is often displaced from the stacking pattern so 


that the purines can interact. Any self-complementary 
sequences in the molecule produce more complex struc¬ 
tures. RNA can base-pair with complementary regions 
of either RNA or DNA. Base pairing matches the pat¬ 
tern for DNA: G pairs with C and A pairs with U (or with 
the occasional T residue in some RNAs). One difference 
is that base pairing between G and U residues—unusual 
in DNA—is fairly common in RNA (see Fig. 8-27). The 
paired strands in RNA or RNA-DNA duplexes are an¬ 
tiparallel, as in DNA. 

RNA has no simple, regular secondary structure 
that serves as a reference point, as does the double he¬ 
lix for DNA. The three-dimensional structures of many 
RNAs, like those of proteins, are complex and unique. 
Weak interactions, especially base-stacking interactions, 
play a major role in stabilizing RNA structures, just as 
they do in DNA. Where complementary sequences are 
present, the predominant double-stranded structure is 
an A-form right-handed double helix. Z-form helices 
have been made in the laboratory (under very high-salt 
or high-temperature conditions). The B form of RNA 
has not been observed. Breaks in the regular A-form he¬ 
lix caused by mismatched or unmatched bases in one 
or both strands are common and result in bulges or in¬ 
ternal loops (Fig. 8-26). Flairpin loops form between 
nearby self-complementary sequences. The potential for 
base-paired helical structures in many RNAs is exten¬ 
sive (F ig. 8-27), and the resulting hairpins are the most 
common type of secondary structure in RNA. Specific 



FIGURE 8-25 Typical right-handed stacking pattern of single- 
stranded RNA. The bases are shown in gray, the phosphate atoms in 
yellow, and the ribosesand phosphate oxygens in green. Green is used 
to represent RNA strands in succeeding chapters, just as blue is used 
for DNA. 
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FIGURE 8-26 Secondary structure of RNAs. (a) Bulge, internal loop, 
and hairpin loop, (b) The paired regions generally have an A-form 
right-handed helix, as shown for a hairpin. 


short base sequences (such as UUCG) are often found 
at the ends of RNA hairpins and are known to form par¬ 
ticularly tight and stable loops. Such sequences may act 
as starting points for the folding of an RNA molecule 
into its precise three-dimensional structure. Important 
additional structural contributions are made by hydro¬ 
gen bonds that are not part of standard Watson-Crick 
base pairs. For example, the 2'-hydroxyl group of ribose 
can hydrogen-bond with other groups. Some of these 
properties are evident in the structure of the phenyl¬ 
alanine transfer RNA of yeast—the tRNA responsible for 
inserting Phe residues into polypeptides—and in two 
RNA enzymes, or ribozymes, whose functions, like those 
of protein enzymes, depend on their three-dimensional 
structures (Fig. 8-28). 

The analysis of RNA structure and the relationship 
between structure and function is an emerging field of 
inquiry that has many of the same complexities as the 
analysis of protein structure. The importance of under¬ 
standing RNA structure grows as we become increas¬ 
ingly aware of the large number of functional roles for 
RNA molecules. 


FIGURE 8-27 Base-paired helical structures in an RNA. Shown 
here is the possible secondary structure of the M1 RN A component 
of the enzyme RNase P of E. coli, with many hairpins. RNase P, 
which also contains a protein component (not shown), functions in 
the processing of transfer RNAs (see Fig. 26-23). The two brackets 
indicate additional complementary sequences that may be paired in 
the three-dimensional structure. The blue dots indicate non-Watson- 
Crick G=U base pairs (boxed inset). Note that G=U base pairs are 
allowed only when presynthesized strands of RNA fold up or anneal 
with each other. There are no RNA polymerases (the enzymes that 
synthesize RNAs on a DNA template) that insert a U opposite a 
template G, or vice versa, during RNA synthesis. 
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FIGURE 8-28 Three-dimensional structure in RNA. (a) Three- 
dimensional structure of phenylalanine tRN A of yeast (PD B ID 1TRA). 
Some unusual base-pairing patterns found in this tRNA are shown. 
Note also the involvement of the oxygen of a ribose phosphodiester 
bond in one hydrogen-bonding arrangement, and a ribose 2'-hydroxyl 
group in another (both in red). (b)A hammerhead ribozyme (so named 
because the secondary structure at the active site looks like the head 
of a hammer), derived from certain plant viruses (derived from PDB 


ID 1MME). Ribozymes, or RNA enzymes, catalyze a variety of reac¬ 
tions, primarily in RNA metabolism and protein synthesis. The com¬ 
plex three-dimensional structures of these RN As reflect the complexity 
inherent in catalysis, as described for protein enzymes in Chapter 6. 
(c) A segment of mRNA known as an intron, from the ciliated proto¬ 
zoan Tetrahymena thermophila (derived from PDB ID 1GRZ). This 
intron (a ribozyme) catalyzes its own excision from between exons in 
an mRNA strand (discussed in Chapter 26). 


SUMMARY 8.2 Nucleic Acid Structure 


■ Many lines of evidence show that DNA bears 
genetic information. In particular, the Avery- 
MacLeod-McCarty experiment showed that DNA 
isolated from one bacterial strain can enter and 
transform the cells of another strain, endowing 
it with some of the inheritable characteristics 
of the donor. The Hershey-Chase experiment 


showed that the DNA of a bacterial virus, but 
not its protein coat, carries the genetic 
message for replication of the virus in a host cell. 

■ Putting together much published data, Watson 
and Crick postulated that native DNA consists 
of two antiparallel chains in a right-handed 
double-helical arrangement. Complementary 
base pairs, A=T and G=C, are formed by 
hydrogen bonding within the helix. The base 
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pairs are stacked perpendicular to the long axis 
of the double helix, 3.4 A apart, with 10.5 base 
pairs per turn. 

■ DNA can exist in several structural forms. Two 
variations of the Watson-Crick form, or B-DNA, 
are A- and Z-DNA. Some sequence-dependent 
structural variations cause bends in the DNA 
molecule. DNA strands with appropriate se¬ 
quences can form hairpin/cruciform structures 
or triplex or tetraplex DNA. 

■ Messenger RNA transfers genetic information 
from DNA to ribosomes for protein synthesis. 
Transfer RNA and ribosomal RNA are also 
involved in protein synthesis. RNA can be 
structurally complex; single RNA strands can 
be folded into hairpins, double-stranded re¬ 
gions, or complex loops. 


8.3 Nucleic Acid Chemistry 

To understand how nucleic acids function, we must un¬ 
derstand their chemical properties as well as their struc¬ 
tures. The role of DNA as a repository of genetic infor¬ 
mation depends in part on its inherent stability. The 
chemical transformations that do occur are generally very 
slow in the absence of an enzyme catalyst. The long-term 
storage of information without alteration is so important 
to a cell, however, that even very slow reactions that 
alter DNA structure can be physiologically significant. 
Processes such as carcinogenesis and aging may be 
intimately linked to slowly accumulating, irreversible al¬ 
terations of DNA. Other, nondestructive alterations also 
occur and are essential to function, such as the strand 
separation that must precede DNA replication or tran¬ 
scription. In addition to providing insights into physio¬ 
logical processes, our understanding of nucleic acid 
chemistry has given us a powerful array of technologies 
that have applications in molecular biology, medicine, and 
forensic science. We now examine the chemical proper¬ 
ties of DNA and some of these technologies. 

Double-Helical DNA and RNA Can Be Denatured 

Solutions of carefully isolated, native DNA are highly 
viscous at pH 7.0 and room temperature (25 °C). When 
such a solution is subjected to extremes of pH or to tem¬ 
peratures above 80 °C, its viscosity decreases sharply, 
indicating that the DNA has undergone a physical 
change. J ust as heat and extremes of pH denature glob¬ 
ular proteins, they also cause denaturation, or melting, 
of double-helical DNA. Disruption of the hydrogen 
bonds between paired bases and of base stacking causes 
unwinding of the double helix to form two single strands, 
completely separate from each other along the entire 


length or part of the length ( partial denaturation) of the 
molecule. No covalent bonds in the DNA are broken 
(Fig. 8-29). 

Renaturation of a DNA molecule is a rapid one-step 
process, as long as a double-helical segment of a dozen 
or more residues still unites the two strands. When the 
temperature or pH is returned to the range in which 
most organisms live, the unwound segments of the two 
strands spontaneously rewind, or anneal, to yield the 
intact duplex (Fig. 8-29). However, if the two strands 
are completely separated, renaturation occurs in two 
steps. In the first, relatively slow step, the two strands 
"find" each other by random collisions and form a short 
segment of complementary double helix. The second 
step is much faster: the remaining unpaired bases suc¬ 
cessively come into register as base pairs, and the two 
strands "zipper" themselves together to form the dou¬ 
ble helix. 

The close interaction between stacked bases in a 
nucleic acid has the effect of decreasing its absorption 
of UV light relative to that of a solution with the same 
concentration of free nucleotides, and the absorption is 
decreased further when two complementary nucleic 
acids strands are paired. This is called the hypochromic 
effect. Denaturation of a double-stranded nucleic acid 
produces the opposite result: an increase in absorption 
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FIGURE 8-29 Reversible denaturation and annealing (renaturation) 
of DNA. 
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called the hyperchromic effect. The transition from 
double-stranded DNA to the single-stranded, denatured 
form can thus be detected by monitoring the absorption 
of UV light. 

Viral or bacterial DNA molecules in solution dena¬ 
ture when they are heated slowly (Fig. 8-30). Each 
species of DNA has a characteristic denaturation tem¬ 
perature, or melting point (t m ): the higher its content 
of G=C base pairs, the higher the melting point of the 
DNA. This is because G=C base pairs, with three hy¬ 
drogen bonds, reguire more heat energy to dissociate 
than A=T base pairs. Careful determination of the melt¬ 
ing point of a DNA specimen, under fixed conditions of 
phi and ionic strength, can yield an estimate of its base 
composition. If denaturation conditions are carefully 
controlled, regions that are rich in A=T base pairs will 
specifically denature while most of the DNA remains 




FIGURE 8-30 Heat denaturation of DNA. (a) The denaturation, or 
melting, curves of two DNA specimens. The temperature at the mid¬ 
point of the transition (t m ) is the melting point; it depends on pH and 
ionic strength and on the size and base composition of the DNA. 
(b) Relationship between t m and the G=C content of a DNA. 
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FIGURE 8-31 Partially denatured DNA. This DNA was partially de¬ 
natured, then fixed to prevent re natu rati on during sample preparation. 
The shadowing method used to visualize the DNA in this electron mi¬ 
crograph increases its diameter approximately fivefold and obliterates 
most details of the helix. However, length measurements can be ob¬ 
tained, and single-stranded regions are readily distinguishable from 
double-stranded regions. The arrows point to some single-stranded 
bubbles where denaturation has occurred. The regions that denature 
are highly reproducible and are rich in A=T base pairs. 

double-stranded. Such denatured regions (called bub¬ 
bles) can be visualized with electron microscopy (Fig. 
8-31). Strand separation of DNA must occur in vivo dur¬ 
ing processes such as DNA replication and transcrip¬ 
tion. As we shall see, the DNA sites where these 
processes are initiated are often rich in A=T base pairs. 

Duplexes of two RNA strands or of one RNA strand 
and one DNA strand (RNA-DNA hybrids) can also be 
denatured. Notably, RNA duplexes are more stable than 
DNA duplexes. At neutral pH, denaturation of a double- 
helical RNA often reguires temperatures 20 °C or more 
higher than those reguired for denaturation of a DNA 
molecule with a comparable seguence. The stability of 
an RNA-DNA hybrid is generally intermediate between 
that of RNA and that of DNA. The physical basis for 
these differences in thermal stability is not known. 

Nucleic Acids from Different Species 
Can Form Hybrids 

The ability of two complementary DNA strands to pair 
with one another can be used to detect similar DNA se- 
guences in two different species or within the genome 
of a single species. If duplex DNAs isolated from human 
cells and from mouse cells are completely denatured by 
heating, then mixed and kept at 65 °C for many hours, 
much of the DNA will anneal. Most of the mouse DNA 
strands anneal with complementary mouse DNA strands 
to form mouse duplex DNA; similarly, most human 
DNA strands anneal with complementary human DNA 
strands. However, some strands of the mouse DNA will 
associate with human DNA strands to yield hybrid 
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duplexes, in which segments of a mouse DNA strand 
form base-paired regions with segments of a human DNA 
strand (Fig. 8-32). This reflects a common evolutionary 
heritage; different organisms generally have some pro¬ 
teins and RNAs with similar functions and, often, simi¬ 
lar structures. In many cases, the DNAs encoding these 
proteins and RNAs have similar sequences. The closer 
the evolutionary relationship between two species, the 
more extensively their DNAs will hybridize. For exam¬ 
ple, human DNA hybridizes much more extensively with 
mouse DNA than with DNA from yeast. 

The hybridization of DNA strands from different 
sources forms the basis for a powerful set of techniques 
essential to the practice of modern molecular genetics. 
A specific DNA sequence or gene can be detected in the 
presence of many other sequences, if one already has 
an appropriate complementary DNA strand (usually la¬ 
beled in some way) to hybridize with it (Chapter 9). The 
complementary DNA can be from a different species or 
from the same species, or it can be synthesized chemi¬ 
cally in the laboratory using techniques described later 



FIGURE 8-32 DNA hybridization. Two DNA samples to becompared 
are completely denatured by heating. When the two solutions are 
mixed and slowly cooled, DNA strands of each sample associate with 
their normal complementary partner and anneal to form duplexes. If 
the two DNAs have significant sequence similarity, they also tend to 
form partial duplexes or hybrids with each other: the greater the se¬ 
quence similarity between the two DNAs, the greater the number of 
hybrids formed. Hybrid formation can be measured in several ways. 
0 ne of the DNAs is usually labeled with a radioactive isotope to sim¬ 
plify the measurements. 


in this chapter. FHybridization techniques can be varied 
to detect a specific RNA rather than DNA. The isolation 
and identification of specific genes and RNAs rely on 
these hybridization techniques. Applications of this 
technology make possible the identification of an indi¬ 
vidual on the basis of a single hair left at the scene of a 
crime or the prediction of the onset of a disease decades 
before symptoms appear (see Box 9-1). 

Nucleotides and Nucleic Acids Undergo 
Nonenzymatic Transformations 

Purines and pyrimidines, along with the nucleotides of 
which they are a part, undergo a number of spontaneous 
alterations in their covalent structure. The rate of these 
reactions is generally very slow, but they are physio¬ 
logically significant because of the cell's very low toler¬ 
ance for alterations in its genetic information. Alter¬ 
ations in DNA structure that produce permanent 
changes in the genetic information encoded therein are 
called mutations, and much evidence suggests an inti¬ 
mate link between the accumulation of mutations in an 
individual organism and the processes of aging and 
carcinogenesis. 

Several nucleotide bases undergo spontaneous loss 
of their exocyclic amino groups (deamination) (Fig. 
8-33a). For example, under typical cellular conditions, 
deamination of cytosine (in DNA) to uracil occurs in 
about one of every 10 7 cytidine residues in 24 hours. 
This corresponds to about 100 spontaneous events per 
day, on average, in a mammalian cell. Deamination of 
adenine and guanine occurs at about l/100th this rate. 

The slow cytosine deamination reaction seems in¬ 
nocuous enough, but is almost certainly the reason why 
DNA contains thymine rather than uracil. The product 
of cytosine deamination (uracil) is readily recognized as 
foreign in DNA and is removed by a repair system 
(Chapter 25). If DNA normally contained uracil, recog¬ 
nition of uracils resulting from cytosine deamination 
would be more difficult, and unrepaired uracils would 
lead to permanent sequence changes as they were 
paired with adenines during replication. Cytosine deam¬ 
ination would gradually lead to a decrease in G=C base 
pairs and an increase in A=U base pairs in the DNA of 
all cells. Over the millennia, cytosine deamination could 
eliminate G=C base pairs and the genetic code that de¬ 
pends on them. Establishing thymine as one of the four 
bases in DNA may well have been one of the crucial 
turning points in evolution, making the long-term stor¬ 
age of genetic information possible. 

Another important reaction in deoxyribonu- 
cleotides is the hydrolysis of the A/-/3-glycosyI bond be¬ 
tween the base and the pentose (Fig. 8-33b). This oc¬ 
curs at a higher rate for purines than for pyrimidines. 
As many as one in 10 5 purines (10,000 per mammalian 
cell) are lost from DNA every 24 hours under typical 
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cellular conditions. Depurination of ribonucleotides and 
RNA is much slower and generally is not considered 
physiologically significant. In the test tube, loss of 
purines can be accelerated by dilute acid. Incubation of 
DNA at pH 3 causes selective removal of the purine 
bases, resulting in a derivative called apurinic acid. 

Other reactions are promoted by radiation. UV light 
induces the condensation of two ethylene groups to 
form a cyclobutane ring. In the cell, the same reaction 
between adjacent pyrimidine bases in nucleic acids 
forms cyclobutane pyrimidine dimers. This happens 
most freguently between adjacent thymidine residues 
on the same DNA strand (Fig. 8-34). A second type of 
pyrimidine dimer, called a 6-4 photoproduct, is also 
formed during UV irradiation. Ionizing radiation (x rays 
and gamma rays) can cause ring opening and fragmen¬ 
tation of bases as well as breaks in the covalent back¬ 
bone of nucleic acids. 

Virtually all forms of life are exposed to energy-rich 
radiation capable of causing chemical changes in DNA. 
Near-UV radiation (with wavelengths of 200 to 400 nm), 
which makes up a significant portion of the solar spec¬ 
trum, is known to cause pyrimidine dimer formation and 
other chemical changes in the DNA of bacteria and of 
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FIGURE 8-33 Some well-characterized nonenzymatic reactions of 
nucleotides, (a) Deamination reactions. Only the base is shown, 
(b) Depurination, in which a purine is lost by hydrolysis of the N-p- 
glycosyl bond. The deoxyribose remaining after depurination is readily 
converted from the /3-furanose to the aldehyde form (see Fig. 8-3). Fur¬ 
ther nonenzymatic reactions are illustrated in Figures 8-34 and 8-35. 

human skin cells. We are subject to a constant field of 
ionizing radiation in the form of cosmic rays, which can 
penetrate deep into the earth, as well as radiation emit¬ 
ted from radioactive elements, such as radium, pluto¬ 
nium, uranium, radon, 14 C, and 3 H. X rays used in med¬ 
ical and dental examinations and in radiation therapy of 
cancer and other diseases are another form of ionizing 
radiation. It is estimated that UV and ionizing radiations 
are responsible for about 10% of all DNA damage caused 
by environmental agents. 

DNA also may be damaged by reactive chemicals in¬ 
troduced into the environment as products of industrial 
activity. Such products may not be injurious per se but 
may be metabolized by cells into forms that are. Two 
prominent classes of such agents (Fig. 8-35) are (1) 
deaminating agents, particularly nitrous acid (HN0 2 ) or 
compounds that can be metabolized to nitrous acid or 
nitrites, and (2) alkylating agents. 

Nitrous acid, formed from organic precursors such 
as nitrosamines and from nitrite and nitrate salts, is a 
potent accelerator of the deamination of bases. Bisulfite 
has similar effects. Both agents are used as preserva¬ 
tives in processed foods to prevent the growth of toxic 
bacteria. They do not appear to increase cancer risks 
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significantly when used in this way, perhaps because 
they are used in small amounts and make only a minor 
contribution to the overall levels of DNA damage. (The 
potential health risk from food spoilage if these preser¬ 
vatives were not used is much greater.) 

Alkylating agents can alter certain bases of DNA. 
For example, the highly reactive chemical dimethylsul- 
fate (Fig. 8-35b) can methylate a guanine to yield O 6 - 
methylguanine, which cannot base-pair with cytosine. 


I ELUgtDTJlLTK 



Many similar reactions are brought about by alkylating 
agents normally present in cells, such as S-adenosyl- 
methionine. 


Possibly the most important source of mutagenic al¬ 
terations in DNA is oxidative damage. Excited-oxygen 
species such as hydrogen peroxide, hydroxyl radicals, 
and superoxide radicals arise during irradiation or as a 
byproduct of aerobic metabolism. Of these species, the 
hydroxyl radicals are responsible for most oxidative 
DNA damage. Cells have an elaborate defense system 
to destroy reactive oxygen species, including enzymes 
such as catalase and superoxide dismutase that convert 
reactive oxygen species to harmless products. A frac¬ 
tion of these oxidants inevitably escape cellular de¬ 
fenses, however, and damage to DNA occurs through 
any of a large, complex group of reactions ranging from 
oxidation of deoxyribose and base moieties to strand 
breaks. Accurate estimates for the extent of this dam¬ 
age are not yet available, but every day the DNA of each 
human cell is subjected to thousands of damaging 
oxidative reactions. 

This is merely a sampling of the best-understood 
reactions that damage DNA. Many carcinogenic com¬ 
pounds in food, water, or air exert their cancer-causing 
effects by modifying bases in DNA. Nevertheless, the in¬ 
tegrity of DNA as a polymer is better maintained than 
that of either RNA or protein, because DNA is the only 
macromolecule that has the benefit of biochemical repair 
systems. These repair processes (described in Chapter 
25) greatly lessen the impact of damage to DNA. 
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FIGURE 8-35 Chemical agents that cause 
DNA damage, (a) Precursors of nitrous acid, 
which promotes deamination reactions. 

(b) Alkylating agents. 



Some Bases of DNA Are Methylated 

Certain nucleotide bases in DNA molecules are enzy¬ 
matically methylated. Adenine and cytosine are methy¬ 
lated more often than guanine and thymine. Methyla¬ 
tion is generally confined to certain sequences or 
regions of a DNA molecule. In some cases the function 
of methylation is well understood; in others the func¬ 
tion remains unclear. All known DNA methylases use S- 
adenosylmethionine as a methyl group donor. E. coli 
has two prominent methylation systems. One serves as 
part of a defense mechanism that helps the cell to dis¬ 
tinguish its DNA from foreign DNA by marking its 
own DNA with methyl groups and destroying (foreign) 
DNA without the methyl groups (this is known as a 
restriction-modification system; see Chapter 9). The 
other system methylates adenosine residues within the 
sequence (5')GATC(3') to /\/ 6 -methyladenosine (Fig. 
8-5a). This is mediated by the Dam (DNA adenine 
methylation) methylase, a component of a system that 
repairs mismatched base pairs formed occasionally dur¬ 
ing DNA replication (see Fig. 25-20). 

In eukaryotic cells, about 5% of cytidine residues in 
DNA are methylated to 5-methylcytidine (Fig. 8-5a). 
Methylation is most common at CpG sequences, pro¬ 
ducing methyl-CpG symmetrically on both strands of the 
DNA. The extent of methylation of CpG sequences 
varies by molecular region in large eukaryotic DNA mol¬ 
ecules. Methylation suppresses the migration of seg¬ 
ments of DNA called transposons, described in Chapter 
25. These methylations of cytosine also have structural 
significance. The presence of 5-methylcytosine in an al¬ 
ternating CpG sequence markedly increases the ten¬ 
dency for that segment of DNA to assume the Z form. 

The Sequences of Long DNA Strands 
Can Be Determined 

In its capacity as a repository of information, a DNA mol¬ 
ecule's most important property is its nucleotide se¬ 


quence. Until the late 1970s, determining the sequence 
of a nucleic acid containing even five or ten nucleotides 
was difficult and very laborious. The development of two 
new techniques in 1977, one by Alan Maxam and Walter 
Gilbert and the other by F rederick Sanger, has made pos¬ 
sible the sequencing of ever larger DNA molecules with 
an ease unimagined just a few decades ago. The tech¬ 
niques depend on an improved understanding of nu¬ 
cleotide chemistry and DNA metabolism, and on elec¬ 
trophoretic methods for separating DNA strands differing 
in size by only one nucleotide. Electrophoresis of DNA is 
similar to that of proteins (see Fig. 3-19). Polyacrylamide 
is often used as the gel matrix in work with short DNA 
molecules (up to a few hundred nucleotides); agarose is 
generally used for longer pieces of DNA. 

In both Sanger and Maxam-GiIbert sequencing, the 
general principle is to reduce the DNA to four sets of la¬ 
beled fragments. The reaction producing each set is 
base-specific, so the lengths of the fragments correspond 
to positions in the DNA sequence where a certain base 
occurs. For example, for an oligonucleotide with the se¬ 
quence pAATCGACT, labeled at the 5' end (the left end), 
a reaction that breaks the DNA after each C residue will 
generate two labeled fragments: a four-nucleotide and a 
seven-nucleotide fragment; a reaction that breaks the 
DNA after each G will produce only one labeled, five- 
nucleotide fragment. Because the fragments are radio- 
actively labeled at their 5' ends, only the fragment to the 
5' side of the break is visualized. The fragment sizes cor¬ 
respond to the relative positions of C and G residues in 
the sequence. When the sets of fragments corresponding 
to each of the four bases are electrophoretically sepa¬ 
rated side by side, they produce a ladder of bands from 
which the sequence can be read directly (Fig. 8-36). We 
illustrate only the Sanger method, because it has proven 
to be technically easier and is in more widespread use. 
It requires the enzymatic synthesis of a DNA strand com¬ 
plementary to the strand under analysis, using a radio- 
actively labeled "primer" and dideoxynucleotides. 















8.3 Nucleic Acid Chemistry 


297 


Primer 

strand 



3'ACGGCT ACT 


Template 



rr 

O—F—O- 


O' 0“ 

V— O— P— O— CHg 




tb'l 



B H 


iMNTP OTjilri,p 




s 





on 

CTAAGCTOOACT 


ThnniplEEta 


dCTP, atiTF, dATP. ifT 


Ilf 


+ UATP 

■ ttt.Tt LI JA a Cl CkS ILA 
■GaTTuOtIiIa 
■UJi'h 



■flATTCflAtadde 
—dAlTMC 


I— flAlTOflAtaCTddO 
h-GA’TTtflAi] JO- 


ft\TH3[lfcdiJdl 

ftvftsJf 


a 



ia 

EE 

to 

€ 

T 

6 

E 

■E 

B 





AuLtiiudii^nsn rf 
£IsKl3 si phPS 0c3 


Z 
A 
a 
T 
il 
■: 

A 

ft 

ft 

T 

T 

A 

ft 

SofUBCBlf 
li cot p I u m m 1 a i y 
■Liund 


FIGURE 8-36 DNA sequencing by the Sanger method. This method 
makes use of the mechanism of DNA synthesis by DNA polymerases 
(Chapter 25). (a) DNA polymerases require both a primer (a short 
oligonucleotide strand), to which nucleotides are added, and a template 
strand to guide selection of each new nucleotide. In cells, the 3'-hy- 

ts with an incoming deoxynucleoside 
n phosphodiester bond, (b) The Sanger 
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sis. (The Sanger method is also known 
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DNA sequencing is readily automated by a varia¬ 
tion of Sanger's sequencing method in which the 
dideoxynucleotides used for each reaction are labeled 
with a differently colored fluorescent tag (Fig. 8-37). 
This technology allows DNA sequences containing thou¬ 
sands of nucleotides to be determined in a few hours. 
Entire genomes of many organisms have now been se¬ 
quenced (see Table 1-4), and many very large DNA- 
sequencing projects are in progress. Perhaps the most 
ambitious of these is the Fluman Genome Project, in 
which researchers have sequenced all 3.2 billion base 
pairs of the DNA in a human cell (Chapter 9). ^ Dideoxy 
Sequencing of DNA 

The Chemical Synthesis of DNA Has Been Automated 

Another technology that has paved the way for many 
biochemical advances is the chemical synthesis of 
oligonucleotides with any chosen sequence. The chem¬ 
ical methods for synthesizing nucleic acids were devel¬ 
oped primarily by FI. Gobind Khorana and his colleagues 
in the 1970s. Refinement and automation of these meth¬ 
ods have made possible the rapid and accurate synthe¬ 
sis of DNA strands. The synthesis is carried out with the 
growing strand attached to a solid support (Fig. 8-38), 
using principles similar to those used by Merrifield in 
peptide synthesis (see F ig. 3-29). The efficiency of each 
addition step is very high, allowing the routine labora¬ 
tory synthesis of polymers containing 70 or 80 nu¬ 
cleotides and, in some laboratories, much longer 
strands. The availability of relatively inexpensive DNA 
polymers with predesigned sequences is having a pow¬ 
erful impact on all areas of biochemistry (Chapter 9). 


FIGURE 8-37 Strategy for automating DNA sequencing reactions. 

Each dideoxynucleotide used in the Sanger method can be linked to 
a fluorescent molecule that gives all the fragments terminating in that 
nucleotide a particular color. All four labeled ddNTPs are added to a 
single tube. The resulting colored DNA fragments are then separated 
by size in a single electrophoretic gel contained in a capillary tube (a 
refinement of gel electrophoresis that allows for faster separations). All 
fragments of a given length migrate through the capillary gel in a single 
peak, and the color associated with each peak is detected using a laser 
beam. The DNA sequence is read by determining the sequence of 
colors in the peaks as they pass the detector. This information is fed 
directly to a computer, which determines the sequence. 
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FIGURE 8-38 Chemical synthesis of DNA. Automated DNA synthe¬ 
sis is conceptually similar to the synthesis of polypeptides on a solid 
support. The oligonucleotide is built up on the solid support (silica), 
one nucleotide at a time, in a repeated series of chemical reactions 
with suitably protected nucleotide precursors. (T) The first nucleoside 
(which will be the 3' end) is attached to the silica support at the 3' 
hydroxyl (through a linking group, R) and is protected at the 5' hy¬ 
droxyl with an acid-labile dimethoxytrityl group (DMT). The reactive 
groups on all bases are also chemically protected. @ The protecting 
DMT group is removed by washing the column with acid (the DMT 
group is colored, so this reaction can be followed spectrophotomet- 
rically). (3) The next nucleotide is activated with a diisopropylamino 


group and reacted with the bound nucleotide to form a 5',3' linkage, 
which in step (4) is oxidized with iodine to produce a phosphotri- 
ester linkage. (One of the phosphate oxygens carries a cyanoethyl pro¬ 
tecting group.) Reactions @ through (4) are repeated until all nu¬ 
cleotides are added. At each step, excess nucleotide is removed before 
addition of the next nucleotide. In steps (5) and (6) the remaining 
protecting groups on the bases and the phosphates are removed, and 
in (7) the oligonucleotide is separated from the solid support and pu¬ 
rified. The chemical synthesis of RN A is somewhat more complicated 
because of the need to protect the 2' hydroxyl of ribose without ad¬ 
versely affecting the reactivity of the 3' hydroxyl. 
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SUMMARY 8.3 Nucleic Acid Chemistry 


■ Native DNA undergoes reversible unwinding 
and separation of strands (melting) on heating 
or at extremes of pH. DNAs rich in G=C pairs 
have higher melting points than DNAs rich in 
A=T pairs. 

■ Denatured single-stranded DNAs from two 
species can form a hybrid duplex, the degree 
of hybridization depending on the extent of 
sequence similarity. Hybridization is the basis 
for important techniques used to study and 
isolate specific genes and RNAs. 

■ DNA is a relatively stable polymer. Spontaneous 
reactions such as deamination of certain bases, 
hydrolysis of base-sugar N -glycosyl bonds, 
radiation-induced formation of pyrimidine 
dimers, and oxidative damage occur at very low 
rates, yet are important because of cells' very 
low tolerance for changes in genetic material 

■ DNA sequences can be determined and DNA 
polymers synthesized with simple, automated 
protocols involving chemical and enzymatic 
methods. 


8.4 Other Functions of Nucleotides 

In addition to their roles as the subunits of nucleic acids, 
nucleotides have a variety of other functions in every 
cell: as energy carriers, components of enzyme cofac¬ 
tors, and chemical messengers. 

Nucleotides Carry Chemical Energy in Cells 

The phosphate group covalently linked at the 5' hy¬ 
droxyl of a ribonucleotide may have one or two addi¬ 
tional phosphates attached. The resulting molecules are 
referred to as nucleoside mono-, di-, and triphosphates 
(Fig. 8-39). Starting from the ribose, the three phos- 
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FIGURE 8-40 The phosphate ester and phosphoanhydride bonds of 

ATP. Hydrolysis of an anhydride bond yields more energy than hy¬ 
drolysis of the ester. A carboxylic acid anhydride and carboxylic acid 
ester are shown for comparison. 
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phates are generally labeled a, / 3 , and y. Hydrolysis of 
nucleoside triphosphates provides the chemical energy 
to drive a wide variety of cellular reactions. Adenosine 
5'-triphosphate, ATP, is by far the most widely used for 
this purpose, but UTP, GTP, and CTP are also used in 
some reactions. Nucleoside triphosphates also serve as 
the activated precursors of DNA and RNA synthesis, as 
described in Chapters 25 and 26. 

The energy released by hydrolysis of ATP and the 
other nucleoside triphosphates is accounted for by the 
structure of the triphosphate group. The bond between 
the ribose and the a phosphate is an ester linkage. The 
a ,/3 and / 3 ,y linkages are phosphoanhydrides (Fig. 
8-40). Hydrolysis of the ester linkage yields about 14 
kj/mol under standard conditions, whereas hydrolysis 
of each anhydride bond yields about 30 kj /mol. ATP hy¬ 
drolysis often plays an important thermodynamic role 
in biosynthesis. When coupled to a reaction with a pos¬ 
itive free-energy change, ATP hydrolysis shifts the equi¬ 
librium of the overall process to favor product forma- 
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FIGURE 8-39 Nucleoside phosphates. General structure of the nucleoside 
5'-mono-, di-, and triphosphates (NMPs, NDPs, and NTPs) and their standard 
abbreviations. In the deoxyribonucleoside phosphates (dNMPs, dNDPs, and 
dNTPs), the pentose is 2'-deoxy-D-ribose. 
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tion (recall the relationship between equilibrium con¬ 
stant and tree-energy change described by Eqn 6-3 on 
p. 195). 

Adenine Nucleotides Are Components of Many 
Enzyme Cofactors 

A variety of enzyme cofactors serving a wide range of 
chemical functions include adenosine as part of their 
structure (Fig. 8-41). They are unrelated structurally 
except for the presence of adenosine. In none of these 
cofactors does the adenosine portion participate directly 
in the primary function, but removal of adenosine gen¬ 
erally results in a drastic reduction of cofactor activi¬ 
ties. For example, removal of the adenine nucleotide 
(3'-phosphoadenosine diphosphate) from acetoacetyl- 


CoA, the coenzyme A derivative of acetoacetate, re¬ 
duces its reactivity as a substrate for /3-ketoacyl-CoA 
transferase (an enzyme of lipid metabolism) by a factor 
of 10 6 . Although this requirement for adenosine has not 
been investigated in detail, it must involve the binding 
energy between enzyme and substrate (or cofactor) that 
is used both in catalysis and in stabilizing the initial 
enzyme-substrate complex (Chapter 6). In the case of 
/3-ketoacyl-CoA transferase, the nucleotide moiety of 
coenzyme A appears to be a binding "handle" that helps 
to pull the substrate (acetoacetyl-CoA) into the active 
site. Similar roles may be found for the nucleoside por¬ 
tion of other nucleotide cofactors. 

Why is adenosine, rather than some other large mol¬ 
ecule, used in these structures 7 The answer here may 
involve a form of evolutionary economy. Adenosine is 
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FIGURE 8-41 Some coenzymes containing adenosine. The 

adenosine portion is shaded in light red. Coenzyme A (CoA) 
functions in acyl group transfer reactions; the acyl group (such as 
the acetyl or acetoacetyl group) is attached to the CoA through a 
thioester linkage to the /3-mercaptoethylamine moiety. NAD + func¬ 
tions in hydride transfers, and FAD, the active form of vitamin B 2 
(riboflavin), in electron transfers. Another coenzyme incorporating 
adenosine is 5'-deoxyadenosylcobalamin, the active form of vita¬ 
min B 12 (see Box 17-2), which participates in intramolecular 
group transfers between adjacent carbons. 
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certainly not unique in the amount of potential binding 
energy it can contribute. The importance of adenosine 
probably lies not so much in some special chemical char¬ 
acteristic as in the evolutionary advantage of using one 
compound for multiple roles. Once ATP became the uni¬ 
versal source of chemical energy, systems developed to 
synthesize ATP in greater abundance than the other nu¬ 
cleotides; because it is abundant, it becomes the logical 
choice for incorporation into a wide variety of struc¬ 
tures. The economy extends to protein structure. A sin¬ 
gle protein domain that binds adenosine can be used in 
a wide variety of enzymes. Such a domain, called a 
nucleotide-binding fold, is found in many enzymes 
that bind ATP and nucleotide cofactors. 

Some Nucleotides Are Regulatory Molecules 

Cells respond to their environment by taking cues from 
hormones or other external chemical signals. The in¬ 
teraction of these extracellular chemical signals ("first 
messengers") with receptors on the cell surface often 
leads to the production of second messengers inside 
the cell, which in turn leads to adaptive changes in 
the cell interior (Chapter 12). Often, the second mes¬ 
senger is a nucleotide (Fig. 8-42). One of the most 
common is adenosine 3',5'-cyclic monophosphate 


(cyclic AMP, or cAMP), formed from ATP in a reac¬ 
tion catalyzed by adenylyl cyclase, an enzyme associ¬ 
ated with the inner face of the plasma membrane. Cyclic 
AMP serves regulatory functions in virtually every cell 
outside the plant kingdom. Guanosine 3',5'-cyclic mono¬ 
phosphate (cGMP) occurs in many cells and also has 
regulatory functions. 

Another regulatory nucleotide, ppGpp (Fig. 8-42), 
is produced in bacteria in response to a slowdown in 
protein synthesis during amino acid starvation. This nu¬ 
cleotide inhibits the synthesis of the rRNA and tRNA 
molecules (see Fig. 28-24) needed for protein synthe¬ 
sis, preventing the unnecessary production of nucleic 
acids. 

SUMMARY 8.4 Other Functions of Nucleotides 


■ ATP is the central carrier of chemical energy in 
cells. The presence of an adenosine moiety in a 
variety of enzyme cofactors may be related to 
binding-energy requirements. 

■ Cyclic AMP, formed from ATP in a reaction 
catalyzed by adenylyl cyclase, is a common 
second messenger produced in response to 
hormones and other chemical signals. 
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Terms in bold are defined in the glossary . 
gene 273 
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Problems 


1. Nucleotide Structure Which positions in a purine 
ring of a purine nucleotide in DNA have the potential to form 
hydrogen bonds but are not involved in Watson-Crick base 
pairing 7 

2. Base Sequence of Complementary DNA Strands 

One strand of a double-helical DNA has the seguence 
(5')GCGCAATATTT CT CA A A ATATT GCGC( 3'). Write the base 
seguence of the complementary strand. What special type of 
seguence is contained in this DNA segment 7 Does the dou¬ 
ble-stranded DNA have the potential to form any alternative 
structures 7 

3. DNA of the Human Body Calculate the weight in 
grams of a double-helical DNA molecule stretching from the 
earth to the moon (-320,000 km). The DNA double helix 
weighs about 1 x lCn 18 g per 1,000 nucleotide pairs; each 
base pair extends 3.4 A. For an interesting comparison, your 
body contains about 0.5 g of DNA 1 

4. DNA Bending Assume that a poly(A) tract five base 
pairs long produces a 20° bend in a DNA strand. Calculate 


the total (net) bend produced in a DNA if the center base 
pairs (the third of five) of two successive (dA) 5 tracts are lo¬ 
cated (a) 10 base pairs apart; (b) 15 base pairs apart. As¬ 
sume 10 base pairs per turn in the DNA double helix. 

5. Distinction between DNA Structure and RNA 
Structure Hairpins may form at palindromic seguences in 
single strands of either RNA or DNA. How is the helical struc¬ 
ture of a long and fully base-paired (except at the end) hair¬ 
pin in RNA different from that of a similar hairpin in DNA 7 

6. Nucleotide Chemistry The cells of many eukaryotic 
organisms have highly specialized systems that specifically 
repair G-T mismatches in DNA. The mismatch is repaired to 
form a G=C (not A=T) base pair. This G-T mismatch repair 
mechanism occurs in addition to a more general system that 
repairs virtually all mismatches. Can you suggest why cells 
might reguire a specialized system to repair G-T mismatches 7 

7. Nucleic Acid Structure Explain why the absorption 
of UV light by double-stranded DNA increases (hyperchromic 
effect) when the DNA is denatured. 
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8. Determination of Protein Concentration in a So¬ 
lution Containing Proteins and Nucleic Acids The con 

centration of protein or nucleic acid in a solution containing 
both can be estimated by using their different light absorp¬ 
tion properties: proteins absorb most strongly at 280 nm and 
nucleic acids at 260 nm. Their respective concentrations in a 
mixture can be estimated by measuring the absorbance (A) 
of the solution at 280 nm and 260 nm and using the table be¬ 
low, which gives F 280 / 2 6o, the ratio of absorbances at 280 and 
260 nm; the percentage of total mass that is nucleic acid; and 
a factor, F, that corrects the A 280 reading and gives a more 
accurate protein estimate. The protein concentration (in 
mg/ml) = F x y4 280 (assuming the cuvette is 1 cm wide). Cal¬ 
culate the protein concentration in a solution of A 280 = 0.69 
and A 260 = 0.94. 


Proportion of 

R 280/260 nucleic acid (%) F 


1.75 

0.00 

1.116 

1.63 

0.25 

1.081 

1.52 

0.50 

1.054 

1.40 

0.75 

1.023 

1.36 

1.00 

0.994 

1.30 

1.25 

0.970 

1.25 

1.50 

0.944 

1.16 

2.00 
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1.09 

2.50 
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1.03 

3.00 
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0.979 

3.50 

0.776 

0.939 

4.00 

0.743 

0.874 

5.00 

0.682 

0.846 

5.50 

0.656 

0.822 

6.00 

0.632 

0.804 

6.50 

0.607 

0.784 

7.00 

0.585 

0.767 

7.50 

0.565 

0.753 

8.00 

0.545 

0.730 

9.00 

0.508 

0.705 

10.00 

0.478 

0.671 

12.00 

0.422 

0.644 

14.00 

0.377 

0.615 

17.00 

0.322 

0.595 

20.00 

0.278 


9. Base Pairing in DNA In samples of DNA isolated from 
two unidentified species of bacteria, X and Y, adenine makes 
up 32% and 17%, respectively, of the total bases. What rela¬ 
tive proportions of adenine, guanine, thymine, and cytosine 
would you expect to find in the two DNA samples 7 What as¬ 
sumptions have you made 7 One of these species was isolated 
from a hot spring (64 °C). Suggest which species is the ther¬ 
mophilic bacterium. What is the basis for your answer 7 

10. Solubility of the Components of DNA Draw the fol¬ 
lowing structures and rate their relative solubilities in water 
(most soluble to least soluble): deoxyribose, guanine, phos¬ 
phate. How are these solubilities consistent with the three- 
dimensional structure of double-stranded DNA 7 


11. DNA Sequencing The following DNA fragment was 
seguenced by the Sanger method. The red asterisk indicates 
a fluorescent label. 

+1$ ****************** ;!■'jiVJi 

S n.mmrmnmi XTrXCUCAAGUkCATTA GA 

A sample of the DNA was reacted with DNA polymerase and 
each of the nucleotide mixtures (in an appropriate buffer) 
listed below. Dideoxynucleotides (ddNTPs) were added in 
relatively small amounts. 

1. dATP, dTTP, dCTP, dGTP, ddTTP 

2. dATP, dTTP, dCTP, dGTP, ddGTP 

3. dATP, dCTP, dGTP, ddTTP 

4. dATP, dTTP, dCTP, dGTP 

The resulting DNA was separated by electrophoresis on an 
agarose gel, and the fluorescent bands on the gel were 
located. The band pattern resulting from nucleotide mixture 
1 is shown below. Assuming that all mixtures were run on the 
same gel, what did the remaining lanes of the gel look like 7 



12. Snake Venom Phosphodiesterase An exonuclease is 
an enzyme that sequentially cleaves nucleotides from the end 
of a polynucleotide strand. Snake venom phosphodiesterase, 
which hydrolyzes nucleotides from the 3' end of any oligonu¬ 
cleotide with a free 3'-hydroxyl group, cleaves between the 
3' hydroxyl of the ribose or deoxyribose and the phosphoryl 
group of the next nucleotide. It acts on single-stranded DNA 
or RNA and has no base specificity. This enzyme was used in 
seguence determination experiments before the development 
of modern nucleic acid seguencing techmgues. What are the 
products of partial digestion by snake venom phosphodi¬ 
esterase of an oligonucleotide with the following seguence 7 

(5')GCGCCAUUGC(3')-OH 

13. Preserving DNA in Bacterial Endospores Bacter¬ 
ial endospores form when the environment is no longer con¬ 
ducive to active cell metabolism. The soil bacterium Bacillus 
subtilis, for example, begins the process of sporulation when 
one or more nutrients are depleted. The end product is a 
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small, metabolically dormant structure that can survive al¬ 
most indefinitely with no detectable metabolism. Spores have 
mechanisms to prevent accumulation of potentially lethal mu¬ 
tations in their DNA over periods of dormancy that can ex¬ 
ceed 1,000 years. B. si/jbt/7/sspores are much more resistant 
than the organism’s growing cells to heat, UV radiation, and 
oxidizing agents, all of which promote mutations. 

(a) One factor that prevents potential DNA damage in 
spores is their greatly decreased water content. How would 
this affect some types of mutations 7 

(b) E ndospores have a category of proteins called small 
acid-soluble proteins (SASPs) that bind to their DNA, pre¬ 
venting formation of cyclobutane-type dimers. What causes 
cyclobutane dimers, and why do bacterial endospores need 
mechanisms to prevent their formation 7 

Biochemistry on the Internet 

14. The Structure of DNA Elucidation of the three 
dimensional structure of DNA helped researchers understand 
how this molecule conveys information that can be faithfully 
replicated from one generation to the next. To see the second¬ 
ary structure of double-stranded DNA, go to the Protein Data 
Bank website (www.rcsb.org/pdb). Use the PDB identifiers 
listed below to retrieve the data pages for the two forms of DNA. 


Open the structures using RasMol or Chime, and use the dif¬ 
ferent viewing options to complete the following exercises. 

(a) Obtain the file for 141D, a highly conserved, re¬ 
peated DNA seguence from the end of the HIV-1 (the virus 
that causes AIDS) genome. Display the molecule as a stick 
or ball-and-stick structure. Identify the sugar-phosphate 
backbonefor each strand ofthe DNA duplex. Locateand iden¬ 
tify individual bases. Which is the 5' end of this molecule 7 
Locate the major and minor grooves. Is this a right- or left- 
handed helix 7 

(b) Obtain the file for 145D, a DNA with the Z confor¬ 
mation. Display the molecule as a stick or ball-and-stick struc¬ 
ture. Identify the sugar-phosphate backbone for each strand 
of the DNA duplex. Is this a right- or left-handed helix 7 

(c) To fully appreciate the secondary structure of DNA, 
select "Stereo” in the Options menu in the viewer. You will 
see two images of the DNA molecule. Sit with your nose ap¬ 
proximately 10 inches from the monitor and focus on the tip 
of your nose. In the background you should see three images 
of the DNA helix. Shift your focus from the tip of your nose 
to the middle image, which should appear three-dimensional. 
(Note that only one of the two authors can make this work.) 
For additional tips, see the Study Guide or the textbook web¬ 
site (www.whfreeman.com/lehnmger). 



